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Abstract 



In this paper, the relevance of the Feller conditions in discrete time macro-finance 
term structure models is investigated. The Feller conditions are usually imposed on a 
continuous time multivariate square root process to ensure that the roots have nonneg- 
ative arguments. For a discrete time approximate model, the Feller conditions do not 
t give this guarantee. Moreover, in a macro-finance context the restrictions imposed might 

^_^ be economically unappealing. At the same time, it has also been observed that even 

Li^ without the Feller conditions imposed, for a practically relevant term structure model, 

r^ negative arguments rarely occur. Using models estimated on German data, we compare 

the yields implied by (approximate) analytic exponentially afline expressions to those ob- 
tained through Monte Carlo simulations of very high numbers of sample paths. It turns 
out that the differences are rarely statistically significant, whether the Feller conditions 
O^ are imposed or not. Moreover, economically the differences are negligible, as they are 

always below one basis point. 
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1 Introduction 



This paper addresses the necessity of the so-called Feller conditions for an affine multivari- 
ate term structure model with time-varying volatility. Although already interesting from 
a theoretical point of view, the main motivation behind this study stems from sometimes 
unattractive implications of the Feller conditions in macro- finance models. A term structure 
model in general involves one or more driving factors, which are usually assumed unobserv- 
able. In the macro-finance models, the driving factors involve macro-economic variables, for 
instance the inflation rate. As these driving factors have a direct economic interpretation, 
the restrictions implied by the Feller conditions can be unappealing from an economic point 
of view. Consequently, most macro-finance term structure models assume a constant volatil- 



ity for the driving factors, see for instance Campbell and Viceira (2002) Ang and Piazzesi 



(2003) Ang and Bekaert (2004) Dewachter, Lyrio and Maes (2004, 2006), Fendel (2005) 



Bernanke, Reinhart, and Sack (2005) Dewachter and Lyrio (2006) [Hordahl, Tristani, and 
Vestin (2006) , Wu (2006) and Rudebusch and Wu (2007) ^ This however implies that interest 
rates are assumed symmetric, which means that either very low interest rates are predicted 
too often or very high interest rates not often enough. Especially for asset liability manage- 
ment purposes for pension funds, these characteristics can easily lead to wrong conclusions as 
the long duration of their liabilities makes them extremely sensitive to interest rate changes. 

The Feller conditions serve several purposes in a continuous time affine term structure 
model with time-varying volatility. First of all, the dynamics of the factor processes involve 
square root terms and the conditions are sufficient to have the arguments of the square 
roots strictly positive. This in turn guarantees that the stochastic differential equations that 
describe the dynamics of the factor have a unique strong solution and that a closed form 
expression for the bond price can be obtained. 

In practice, one often works with discrete time models, which, for instance, can be ob- 
tained by discretizing a continuous time model. For a discrete time model, existence of 
(strong) solutions is clearly not an issue and the Feller conditions play no role in this con- 
text. Moreover, the Feller conditions applied to a discretized model are useless to guarantee 
that the square roots always have nonnegative arguments. Indeed, the standard normally 
distributed errors, that are used as inputs in these discrete time models, imply that at each 
time instant there is a positive probability that one or more of the arguments of the square 
roots become negative, regardless whether the Feller conditions are satisfied or notjj [^ In 
spite of all this, it is not uncommon to impose the Feller conditions on discrete time models, 
as they serve as approximations of continuous time models. 

In a two-dimensional setting with one volatility factor (denoted v), one of the Feller 
conditions imposes that for every point on the line v = 0, the deterministic part of the 



^An exception is Spencer (2004) who specifies a 10- factor model for the US yield curve, including one 
heteroscedasticity factor which is a linear combination of several macroeconomic variables. 



This has already been observed in 



Backus, Foresi, and Telmer (2001, p. 290) 



(one of the first papers with 



an affine model in discrete time) for a one-dimensional process, although curiously enough, in the same paper 
it is claimed that the multivariate Feller conditions are sufficient for nonnegative arguments. 



■^An alternative would be to assume a Poisson mixture of Gamma distributions (Dai, Le, and Singleton 
|2005[ ) for the volatility factor instead. For a macro-finance model, this is problematic however, as volatility 
is an unknown linear function of the underlying data. Imposing the volatility factor to be equal to one of 
the driving factors (for instance expected inflation) has the disadvantage that this factor is not allowed to be 
influenced by the other factors (for instance the real short term interest rate). 









\ x^;rx 




'^^^ir^ ^^ 




^"*^55^^4*^'fv^s=ii^^^^^7r^ i+^s^L* — 




^^^^^^^5^ 








"^^^^^^/"""^ 


i (1 2 ^"^T^^ ® 8 10 12 14 1 


4- 


drift v>o\driftv<0 ^;;5~-^^>0 



Figure 1: Example of a sample path of a discrete time ATSM for which the Feller conditions are not 
satisfied. 



process (the drift) is such that the volatihty becomes positive again. Although it is clear that 
this condition is necessary in a univariate setting, its significance in a multivariate setting 
is not obvious as the interaction between the factors limits the part of the line v = that 
is actually approached. To illustrate this, Figure [T] shows a typical trajectory of a two-state 
factor process for a discrete time model with one volatility factor, and where the Feller 
conditions are not imposed. Although the line f = is crossed once, this happens in the 
area where the drift of the volatility process is positive, which would be impossible in a 
continuous time model. In other words, the fact that v also assumes negative values is due 
to the discretization of the model, not because the Feller conditions are violated. 

All this motivates a study that sheds some light on the necessity to impose the Feller 
conditions. If one assumes for a continuous time model that all volatilities are positive 
(which happens if the Feller conditions hold), there is an exponentially affine formula that 
expresses bond prices in terms of the state factor. The coefficients in this formula can be 
expressed in the model parameters. For a discrete time model, one can algebraically derive 
a similar expression, but again under the assumption that the square root factors are always 
positive. Since the latter does not hold (even when the Feller conditions are imposed), an 
exponential affine formula in discrete time has to be considered as yet another approximation 
for the bond price rather than an exact expression. 

In order to investigate the accuracy of such an approximation, we compare them with an 
alternative method for validating bond prices, by making use of Monte Carlo simulations. 
This will be done for a multi-factor discrete time term structure model with expected inflation 
and the real short rate as factors. Such a model will be estimated using European data. 

Since expected inflation is not observed, we use the Kalman fllter combined with a like- 
lihood approach to estimate the involved parameters. Estimation is performed for a number 
of cases, that will be referred to as models with independent volatilities, dependent volatilities 
and proportional volatilities. Each of these models will be estimated, with and without the 
Feller conditions imposed. In a pure latent variable model, it is usual to impose these condi- 
tions by assuming a canonical form, as in Dai and Singleton (2000)] In such a canonical form 



the volatility factors are equal to some of the state factors. However, we cannot do this for 
a macro-finance model as ours, since none of the factors can be taken as a volatility factor 
a priori. Therefore we will extract explicit parameter restrictions from the Feller conditions 
for our non-latent variable model. 

Having executed the estimation, we compare two consecutive approaches to validate bond 
prices. In the first one, we calculate the bond prices directly given the estimated parameters, 
using the Riccati equations (thereby ignoring the cut-off of volatility at zero). In the second 
approach, we perform a high number of Monte Carlo simulations of the trajectories of the 
factors, whereby volatility is restricted to be nonnegative. The mean of these simulations 
gives a second approximation of the bond price. Moreover, we can measure the approximation 
error with (sampled) confidence intervals. We will show that the differences between Monte 
Carlo results and the values obtained from the exponential affine formula are almost always 
negligible, both economically and statistically, whether the Feller conditions are imposed 
or not. From an economic point of view, the difference in implied yields between the two 
methods is hardly relevant, as it is at most one basis point. Statistically, the difference is 
only significantly different from zero for some maturities for the dependent and independent 
volatility models without Feller conditions. For the proportional volatility models, there is 
never a problem. 

The rest of this paper is organized as follows. In Section |2] we review general and affine 
term structure models in continuous time, mainly with the aim to set the notation for the sec- 
tions to follow. Moreover, we explicitly pin down the Feller conditions for a two-dimensional 
affine model, since the model is not given in canonical form. In Section [3] we discretize a 
continuous time model and show that the discretized model leads to the same expression for 
bond prices as a the discretized version of formula for bond prices in continuous time. In 
Section |4] we present and estimate our models. In Section |5] we use the estimated models 
of Section |4] to price bonds by Monte Carlo methods and compare the obtained results with 
those obtained by analytic methods. Finally in Section |6] we summarize our findings and 
draw some conclusions. 

2 Affine term structure models in continuous time 

Although we propose an affine term structure model for discrete time, we first discuss con- 
tinuous time models. The reason for this is that the mathematical theory for affine term 



structure models was initially developed for continuous time models (Duffie and Kan 1996), 
whence in this respect it is natural to regard the equations governing the discrete time model 
as discretizations of the continuous time equations. In particular the Feller conditions we 
are interested in apply to continuous time models only. In order to understand the necessity 
of this condition in discrete time, we ffist have to understand the reason why it should be 
imposed for continuous time and then we can check whether these reasons still apply af- 
ter discretization. Furthermore, since the Feller conditions use continuous time parameters, 
exact knowledge of the correspondence between discrete and continuous time parameters is 
required in order to impose it on the discrete time model. 

2.1 Short rate term structure models 

Let us first concisely recall some general theory of short rate term structure models, see 



Hunt and Kennedy (2000), Musiela and Rutkowski (1997) or Brigo and Mercurio (2006) for 



details. The formulas below will be used in subsequent sections. We assume that all relevant 
expressions are well defined. 

In a short rate term structure model the price Dt_T of a zero-coupon bond at time t 
maturing at T is based on the dynamics of the short rate r through the formula 

Di,T = EQ(exp(- / r,ds)|.Fi), (1) 

with Q the risk-neutral measure and (J-'t) the underlying filtration. 

Typically, in a short rate model one chooses r to be a function of a (possibly multi- 
dimensional) process X which satisfies a stochastic differential equation (SDE) 



dXt = /x(t, Xt)dt + a{t, Xt)dW[ 



t 5 



with W^ a multivariate Brownian motion under the risk-neutral measure Q and one writes 
rt = r{Xt). 

Under rather general conditions there exists a strong solution X to this equation which is 
Markov. In this case the bond-price can be written as Dt^T = EQ(exp(— J^ r{Xs)ds)\Xt) =: 
F{t, Xt) for some function F. If F is smooth enough, then it solves the fundamental partial 



differential equation (PDE), also called term structure equation (see Musiela and Rutkowski 



(1997, Chapter 12) or Vasicek (1977)[ where the latter terminology was introduced) 

^F(t,x)+£F(t,x)-r(x)F(t,x)=0, F{T,x) = l, (2) 

with 



■7 1 -^ 






the generator of X, where a means the transpose of a. 

The physical measure P is equivalent to the risk-neutral measure Q and related via a 
density process L by Lt = ^ . The process L can often be written as an exponential 



t 1 rt 



process £{Y ■ W^) for some y, i.e. 

Lt = exp{l YjdW^-^ I YjYsds), (3) 

Jo ^ Jo 

where 1^ is usually called the market pri ce of risk. According to Girsanov's theor em Wf = 
Wf^ — Jq Ygds is a P-Brownian motion, see Karatzas and Shreve (1991, Section 3.5) for details 



on absolutely continuous measure transformations. Using these relations one can write the 
SDE for X under the physical measure P: 

dXt = {ii{t, Xt) + a{t, Xt)Yt)dt + a{t, Xt)dWf . (4) 

2.2 AfRne term structure models 

Affine term structure models (ATSM's) are examples of short rate models and were introduced 



by Duffie and Kan (1996) We will give an overview here and explicitly point out where the 



Feller conditions are needed. 



In an ATSM the short rate r is an affine function of X, i.e. r = 60 + 5 X for some Sq £ M, 
6 G M", and X satisfies under Q an n-dimensional affine square root SDE 

dXt = {aXt + b)dt + ^ ,Jv{Xt)dW^ . (5) 

Here W'^ is an n-diniensional Brownian motion, v{Xt) is a diagonal matrix with on its 
diagonal the elements of the vector 

dmg[v{Xt)]=a + (5Xt, (6) 

with a G R"'^^ (3 G W^"^ (so Vii{x) = Oj + /3jX, with j3i the i-th row vector of 13). We will call 
these elements volatility factors and we write Vt '■= v{Xt) and Vi^t '■= Vi{Xt). For brevity, we 
denote by \/Vt the matrix with on the diagonal the square roots ^/Vi^t V 0, that is the square 
root of the maximum of Vi^t and 0. We will also use the notation Vi V for the diagonal 
matrix with elements Vi^t V 0. Notice that {^fVtf' = Vf V 0. 

Since the diffusion function x 1-^ S ^v{x) is not Lipschitz continuous for those x for which 
v{x) = 0, we cannot apply standard results to assure existence and uniqueness of a strong 
solution for d5|, unless Vj > almost surely. The multivariate Feller conditions as given 



m 



Duffie and Kan (1996) and treated in section 2.3 are sufficient for this. Thus we have 
encountered the first reason to impose the Feller conditions. 

In affine term structure models it is often desired that the process X also satisfies an 
affine square root SDE under the physical measure P, which considerably restricts the choice 
for the market price of risk Y . We only consider the so-called completely affine model, which 



means that we take Yj = yT^A with A G M", and we refer to Duffee (2002) for other options. 
In this case the SDE (H]) takes the form 

dXt = (aXt + b + i:{^/VtfX)dt + J^y^tdWf. (7) 

Under the condition that Vt > (elementwise on the diagonal), it holds that (VT^)^ = ^ 
and Equation ([7| reduces to 

dXt = (aXt + 6) dt + S ^/VtdWf , (8) 

with a = a + T,{(3 Q X), b = b + T,{a Q X)]q The affine structure of (|8J) is thus valid under the 
Feller conditions, which provides the second reason to have it imposed. 

The third reason for imposing the Feller conditions is of more practical importance, as it 
concerns the closed-form expression for the bond price. Indeed, under these conditions it is 
possible to solve the term structure equation ^ by 

F{t,x)=ex.p{A{T-t) + B{T-t)^x), (9) 

for t G [0,T] and x G D := {x £ M" : Vi{x) > 0,Vi}, where A and B satisfy the Riccati 
ordinary differential equations (ODE's) 

A' = b'^B + ^a'^{^'^Bf'^-5o, ^(0) = 0; (10) 

B' = a'^B + -(3'^{^'^Bf^-6, 5(0) = 0, (11) 



*The Hadamard-product denotes entry-wise multiplication, i.e. {v w)ij = VijWij for m x n-matrices 
V and w. We abbreviate v Q v to u®^. Furthermore we use the Hadamard-product also when 11 is an m- 
dimensional vector (instead of an m x n-matrix) and w an m x n-matrix (and vice versa), so {v w)ij = ViWij 
(or {v Q ■w)ij = VijWi in the other case). 



see Duffie and Kan (1996) However, if the bond price Dt^T equals F{t,Xt), then it is 
necessary that Xt € T> almost surely, since the domain of F is [0, T] x T>. In other words, we 
need that Vi^t > almost surely, for all i and t. 

2.3 Multivariate Feller conditions in two dimensions 

We will now treat the multivariate Feller conditions for positive volatility factors, as given in 



Duffie and Kan (1996) 



Proposition 1 (Duffie and Kan (1996) |. Let X be a solution to the affine square root 
SDE 1^. Then Xt ^ Vj^t > holds almost surely under Q if the multivariate Feller 
conditions hold, that is for all i, j we hav^ 



PiT,^ =0 or Vi = Vj + c for some c > 0, (12) 

(3i{ax + b)> ^Pi^^'^pJ for all x G dV\ (13) 

where we write T> := {x £ M" : Vi{x) > 0,i = 1, . . . ,n}, 92?* := {x G M" : Vi{x) = 0,Vj{x) > 
0, Vj 7^ i} and T,^ for the j-th column vector of T,. 

In a pure latent variable model the Feller conditions can be imposed by assuming a 
canonical form for SDE (p|, as shown by Dai and Singleton (2000) In such a canonical 



form the volatility factors are equal to some of the state factors. However, we cannot do 
this for macro-finance models as requiring one of the factors to be the volatility factor is 
overly restrictive. Therefore we need to extract explicit parameter restrictions from the 
Feller conditions in order to impose it for a non-latent variable model. We will do this for 
dimension 2. 

Let X be a solution to the SDE ([5| for n = 2. We distinguish three cases: proportional 
(linear dependent) volatilities, linearly dependent but non-proportional volatilities and linearly 
independent volatilities. The first case is characterized by V2 = kvi for some k >0, the second 
case corresponds to V2 = A;vi -|-c with A; > 0, c > and in the third case one has det /3 7^ 0. For 
the first and second case we take k = 1, i.e. k is absorbed in S, so that we can apply the above 
proposition. For future reference we also introduce 71 = (— /?i2,/5ii) and 72 = (/322) — /32i), 
where the Pij are the elements of the matrix /3. 



Proportional volatilities: Since we take V2 = kvi with k = 1, condition (12) is auto- 
matically satisfied. Hence, for Proposition [I] to hold, we only have to impose (13). 
Note that x G 92?^ = dD^ if and only if x = —j^^iPi + yij , for some y G M, where 



\Pi? 



\Pi\ denotes the Euclidean norm of the vector (3i. Equation (13) for i = 1 becomes 
Pi{-\^aPj + ya'yj + b) > \PiT.T^ 01 for ah y G M. This reduces to the following set 
of conditions 



\pi\ ^ 



PiaPl + Pib > -/3iEE ' PI (14) 

PiajJ = 0. (15) 



'In 



Duffie and Kan (1996) 



it is assumed that c = 0, but it is not hard to see that we can take c > as well. 



Dependent but unproportional volatilities: In this case U2 = t^i + c, with c > 0. Then 
condition (12) is automatically satisfied for i = 2, j = 1, but for i = 1, j = 2 we have 

^2 



to impose the extra condition 



AS^ 



0. 



(16) 



Note that dV^ = 0, so for condition (13) we only have to consider the case i = 1. The 
analysis is completely the same as for the case of proportional volatilities. Hence the 
conditions of Proposition [I] are equivalent to the set of conditions (14), (15) and (16). 

Independent volatilities: Suppose det/3 ^ 0, then /3~^ exists. Obviously neither V2 = 
vi + c nor vi = V2 + c holds true for some positive c, so for condition (12) to hold, we 
need to impose the restrictions 



AS^ = 0, 



(17) 
(18) 



/32Si = 0. 

Note that x G dD^ , respectively x G dD^ , if and only if a + (3x G {0} x ]R>o respectively 
a + (5x ^ M>o X {0}. Hence condition (13) is satisfied if and only if 



I3i (a(i-^ [{^ -«)+&)> ^/3iSS^/37, for ah w; > 0, 
/32 (a/3"^ ((o) -«)+&)> ^/32SST/3j, for ah ?; > 0. 



(19) 
(20) 



Since 



r^ 



1 / fe 

det/3 1-/321 



-/3i2' 



det/3 



(72^ 77), 



we can reduce the restrictions (19) and (20) to 

detp 2 

detp 2 

These hold true if and only if 



for all w >0. 
for all V > 0. 



PiajJ 
det/3 

/?2Q7j 
det/? 



>0 

> 

1 



f3ib - pia(3'^a > ^/3iSS^/37 
f32b-p2ar^a>^p2^^'^pj- 



(21) 
(22) 
(23) 
(24) 



The first two are necessary since v and w can be chosen arbitrarily large, while the latter 
two follow by choosing v = w = 0. In conclusion we can say that the requirements in 
Proposition [I] are met, if the conditions (17), (18) and (21)-(24) hold. 



It is worth noting that Xt € 'Dj'^t € [0, T] holds almost surely under Q if and only if it holds 
almost surely under P, by the equivalence of Q and P. Furthermore, X solves ([5]) if and only 
if it solves ([7]) , and under the conditions of the proposition it also solves d8| . Hence one can 
rephrase the conditions of the proposition by using the parameters of ([8| instead of those of 
(p]), which gives the alternative to (13), but under (12) equivalent, condition 

Pi{ax + b) > ^ASS^/37 for all x G dV\ (25) 



Consequently, under P the Feller conditions are also fulfilled under restrictions (14) to (24), 
with a and b replaced by a and b. 

3 Affine term structure models in discrete time 

In this section we take Equations ([I]), ([s]), ([8|, ([9]) as point of departure and transform it into 



their discrete time counterparts, using the Euler method (Kloeden and Platen 1999). Next 
we investigate whether the resulting equations are consistent with each other and in which 
sense the Feller conditions are necessary in this respect. 

In order not to complicate notation, we assume a discretization factor equal to one. We 
write Pn,t for the bond price at time t maturing at time t + n (which corresponds to Dt^t+n) 
in continuous time). Basically, all continuous time formulas are translated to discrete-time 
by replacing integrals by sums and substituting A for d. By the properties of the Brownian 
motion we have AW^^ = W^-^ — W^ ~ A^(0, /) for each t, and all these increments are 
mutually independent. Therefore we write efj^i instead of dW^, with £{_^i i.i.d. standard 
normal variables under the risk neutral measure Q. For the filtration we choose the natural 
filtration !Ft = (^{^t '• ^ ~ 1; • • • ; 0- We assume that Q and the physical measure P on J^t are 
related by dF = LtdQ, with L the discretized exponential process 

L, = exp(^ATv^e^^, - ^^AT(T4 VO)A). 

A:=0 k=0 

In the continuous time case, W^ defined by dW^ = dWf^ — y/VtXdt is a Brownian motion 
under P according to Girsanov's theorem. Analogously, in discrete time, one can show that 
the ef defined by e^_^_i = ef^-y — y/VtX are i.i.d. standard normal variables under P. So, we 
replace dW^ in (jsj with e^. 

Application of the above substitutions enables us to transform the continuous time model 
into a discrete time model. The two stochastic differential equations ^ and ^ under Q and 
P respectively, transform into 

Xt+i = {I + a)Xt + b + J:y^tef^^ (26) 

Xt+i = {I + a)Xt + b + ^^/Vte^+^. (27) 

The bond price formula M becomes 

n-l 

Pn,t = EQ(exp(-^ri+fc)|.Ft). (28) 

fc=0 

Finally, the closed form expression (|9| for the bond price in continuous time corresponds 
to F{n, x) = exjp{An + B^x) in discrete time, with n = T — t and An and Bn the Euler 



discretizations of the solutions of the ODE's (10) and (11). The latter means that A^ and 
Bn satisfy the Riccati recursions 



Bn+1 = (/ + a)^Bn + ^/3T(ST5„)Q2 _ 5^ 



^0 = 0; 
Bo = 0, 



which are equivalent to 



An+l =An + (b- S(a X))^Bn + -cJ{T^Bnf^ - 60, 
Bn+1 = (/ + a - S(/3 \))^Bn + ^/3^(ST5„)®2 _ 5, 



(29) 
(30) 

Ao = 0; (31) 
Bo = 0. (32) 



Now that we have derived the discrete time equations, it is important to note that it 
is impossible to prevent the volatility factors Vt^i from becoming negative, since the noise 
variables are normally distributed. So in this respect it is useless to impose the Feller condi- 
tions. Does the possibility of negative volatility factors lead to any consistency problems for 
our discrete time model? We saw that in continuous time there were three reasons to have 
positive volatility factors. The first reason concerned existence and uniqueness of a strong 
solution for the SDE, which is not an issue in discrete time. The second reason was for 
writing {\/Vt)'^ as Vt, which enabled us to write ([7|) as ([8|), an affine square root SDE under 
the physical measure. Since in discrete time Vt can always become negative, in this case it 
does not hold true that (\/l4)^ = Vt, which implies that the dynamics of X given by (26) 
and (27) are not consistent with each other. Recalling the definition of e^, we see that there 



are two possibilities to solve this problem, either we keep (26) and replace (27) by 



Xt+i = (/ + a)Xt + b + ^{Vt V 0)A + SyV^ef+i, 



or we keep (27) and replace (26) by 



Xt+i = {I + a)Xt + b-^{Vt V 0)A + ^VVte^ 



+1- 



(33) 



(34) 



We opt for the latter, because an attractive expression under the physical measure is prefer- 
able in view of estimation of the parameters. 

The third reason why we needed positive volatility factors in continuous time was to 
solve the term structure equation ([2| in order to obtain a closed form expression F for the 
bond price. There is no discrete time analogue of a term structure equation. However, using 
induction and the properties of a log-normal variable, we can algebraically derive a closed 
form expression for the bond price in discrete time, and, just as in continuous time, we need 
positiveness of the volatility factors for this, see Proposition [2] below. Remarkable is that 
this leads to the same expression as F, the discretization of the closed form expression F in 
continuous time. This is summarized in Figure [2] in a commutative diagram. 



Proposition 2. Let X satisfy (26). Then for Pn^t given by ^2^ it holds that 



Pn,t > F{n, t) = exp(A„ + BjXt), (35) 

with equality iff Vt > almost surely for all t. The scalars An and vectors Bn satisfy the 



Riccati recursions (29) and (30). 
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Proof We give a proof by induction. For n = it holds that P^^t = Po,t = 1, so the statement 
holds true with Aq = Bq = 0. Now suppose Pn~i,t > exp(^„_i + Bj_iXt) for all t and for a 
certain n G N. We write 



n,t 



Er 



n-1 



exp Yj ^*h 



fc=0 



^t 



EQ[Pn-i,t+ie-'-'\Tt] 



and use the induction hypothesis to get 



Pn,t > EQ[exp(A„_i + Bj,_,Xt+i -So- 5'^ Xt)\J^t] 

= EQ[exp(A„_i + Bj_,{{I + a)Xt + b + ^VVte%,) - 6o - 6^ Xt)\rt] 

= eMAn-i + B^_^b -6o + ((/ + a)^5„_i - 6)^ Xt)EQ[eMBI-l^^Me?+l)\:Ft] 

> exp(A„_i + B^^^b -6o + ((/ + a)'^Bn-i - S)^ Xt + ^B^_^^VtJ:'^ B^^i) 
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exp { ^„_i + 5^_i6 - ,5o + -a ' (S ' 5„_i 



+ ((/ + a)Tfi„_i - 6 + -/3T(sTi?„_i)®2)T^^ J ^ 



^n 



where we have used that (with £ = S^i?„_i 



EQ[exp{l ' V^teS-i)l-^t] = expi^i'^iVt V 0)£) > expC^^^y*^) 



(36) 



and 



i^Vti = i^{a Qi)+ i^{(3Xt Qi) = a^{i Q i) + {pXt)'^ {£ Q i) = a^ i®^ + Xj (3^ l®^ 

= aT^02 + (/3T^02)T^^^ 



If Vi > then the inequality in (36) is an equality. This proves the assertion. 



D 



Of course, the probability that Vt gets negative is positive for all i, so Pn,t is not equal to 
-F(n, Xt). Note also that the above proposition is inapplicable for our model, as X solves (34) 



instead of (26). However, from a heuristic point of view, if Vt gets negative with very small 



probability, (34) and (26) are "almost" equivalent and the inequality in (36) is "almost" an 



equality. This suggests that F might be a good approximation for P. 

A priori though, it is not clear whether a model with parameters that satisfy the Feller 
conditions yields a better approximation than a model without parameter restrictions, as we 
do not know how the Feller conditions affect the probability for negative Vt in discrete time. 
In fact, in the introduction we already saw an example in discrete time in which the relative 
frequency of negative Vt was rather small, without having the Feller conditions imposed. 

In the remaining sections we implement and estimate the discrete time model for dimen- 
sion 2 using real data and we investigate how well in this case F approximates P. This is 
done by comparing F with Monte Carlo computations for the bond price P, based on (28) 
and pi). 
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dXt = ii{Xt)dt + cr{Xt)dW^ 



AXt = fMiXt) + a{Xt)e'f^^ 



h^ 



discretize 
SDE ► Time Series 



dt 



F + CF -rF = 0, solve PDE 



Calculate P„,Pn,t = E [P„.i^t+ie-''t\rt] 



discretize 
ODE »■ Recursion 



d^ A 



A' = b'^B+-a'^(S'^Bf^^-5o 



AA„ = b'^Bn + iaT(2TB^)©2 _ 5^ 



B' = a^B+-f3'^{S'^Bf^^ -5 



AB„ = a'^Bn + i/3^(S^B„)®2 _ s 



Figure 2: Commutative diagram for calculating the bond-price in discrete time 

4 Implementation and estimation of the discrete time ATSM 

In this section, we investigate a two- factor model with the ex- ante real short term rate and 
expected inflation as state variables, the nominal short rate being the sum of these two 
factors. The interaction between interest rates and inflation is important, for instance for 
pension funds, as most of them have the intention to give indexation, whereas index linked 
bonds are hardly available. For the Netherlands, another important motivation for modeling 
this link is that supervision is geared toward nominal guarantees. 



4.1 Specification of the model 

Let Xi^t denote the ex-ante real short term rate at time t and X2^t the expected inflation. We 
use our dynamic model for quarterly data, so time is measured in quarters. Consequently, 
rt as used in the pricing formulas for bonds, is given in ordinary fractions per time unit, in 
our case per quarter. For numerical and readability reasons however, we want to express X 
in percentages per year. Therefore, we have rt = {Xi^t + -'^2,i)/400. 

With respect to inflation, we are primarily interested in the ex-ante expectation and not 
so much in past realizations. Let vTi+i denote the inflation rate from i to f -|- 1 (also in 
percentages per year), and X2^t its ex-ante expectation at date t. The observed processes are 
the short nominal rate rt and vrt. Since the inflation rate exhibits a seasonal pattern, we also 
include a seasonal contribution St in the model. 



Apart from the dynamics of the state process X as given in Equation (27), our model is 
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described by the following equations, that relate the state variables to the observations. 



n = (Xi,i + X2,i)/400 

TTt+l = X2^t + St+l + ti^TT V ^2,t-l?7r,t+l 

St+i = —St — St-i — St~2 + ^syy2,t-iis,t+i, 



(37a) 
(37b) 
(37c) 



where ^7r,t and E,s,t are standard normally distributed error terms, that are independent under 
the physical measure of ef (which is the error term in the equation for Xt). 

The data we are using for estimation are the observed longer maturity yields (denoted r„^f , 
measured in fractions per quarter). These are modeled by the exponential affine expression 
for the bondprice plus a measurement error, which is assumed to be independently identically 
distributed among maturities: 



'n,i •- 



[An + BnXt) /n+[vo + Ul^JVi^t~-l + y2^JV2,t-i] i. 



•,n,tj 



(38) 



under the restrictions Vi > 0, and where ^n,t ~ -^(0,/). 

Having fully specified the model, we turn to the estimation procedure. A complicating 



matter is that both factors are not observed. Therefore, the extended Kalman filter (Har- 



vey 19891 is used to estimate the models] In principle, all parameters can be estimated 



simultaneously. In practice however, a one-step procedure tends to lead to unrealistic ex- 
pected inflation predictions as the best fit for the bond prices are not necessarily achieved 
for the most realistic expected inflation estimates. As an appropriate modeling of the time 
series dynamics of interest rates and inflation is considered more important than the lowest 
measurement error for bond prices, we prefer a two-step procedure. In the first step, the 



parameters for system (37) are estimated, combined with the dynamics (27) for Xt and its 



volatility m. In the second step, the system is augmented by the equations for the long-term 
yields (38), and we estimate A and u, using the Riccati recursions (31) and (32), conditional 



on the first- step parameters. 



4.2 Estimation results 

The models are estimated with quarterly German data over the sample period 1959 to 2007. 
In order to estimate the dynamics between interest rates and inflation correctly, a long sample 
period is preferred. On the other hand, prices of zero coupon bonds are only available for 
a relatively short sample period, especially for longer maturities. Therefore, an unbalanced 
panel was used, with the short rate and inflation data starting in the last quarter of 1959, 
1, 2, 4, 7 and 10-year rates starting the third quarter of 1972, the 15 year rate from 1986:11, 
and the 30- year rate from 1996:1 on. 

Table[T]shows the estimation results for the models without the Feller conditions imposed. 
As only the conditional covariance matrix of the noise terms, which is given by S(Vi VO)S^, is 
identifiable, we fix En = 1 in all models, we choose Pi = P2 in the proportional and dependent 
models, we take S12 = in the proportional model, and S22 = 1 in the independent model. 



The extension is due to the variance equation, that includes state variables. Consequently, the true 
variance process is not known exactly, but has to be estimated as well. The resulting inconsistency does not 
seem to be very important, though in short samples the mean reversion parameters are often biased upwards, 



see|Lund (1997)||Duan and Simonato (1999)|[De Jong (2000)] [Bolder (2001)[pien and Scott (2003)[[Duffee 
and Stanton (2004)1 andlDe Rossi (2006) 
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Tabic 1: Estimation results without the Feller conditions imposed 



Proportional volatilities Dependent volatilities Independent volatilities 



-a-^b 



I + a 



a 



P 



- 2.36 

_ (2.9) 


3.05 1 
(4.5) . 




r 2.39 
. (3.0) 


3.03 1 
(6.3) . 




r 2.33 
. (2.0) 


3.12 1 
(5.2) . 


0.926 

(26.5) 

-0.002 

(0.1) 


0.087 
(1.7) 
0.938 
(24.0) 




0.948 

(26.3) 

-0.026 

(0.9) 


0.111 

(2.8) 
0.950 

(25.5) 




0.946 
(24.4) 
-0.006 

(0.2) 


0.102 - 

0.940 
(24.3) J 


-0.377 
(2.3) 


-0.377 
(2.3) 




-0.373 
(12.2) 


-0.165 

(1.4) 




-0.377 

(3.2) 


-0.081 
(1.5) 


0.105 

(2.3i 
0.105 
(2.3) 


0.230 1 
(2.9) 
0.230 
(2.9) J 




r 0.108 

(4.7) 

0.108 

L (4.7) 


0.194 1 
(10.4) J 




- 0.117 

0.015 
L (1.4) 


0.193 1 
(3.3) 
0.091 
(2.6) J 


1 

-0.257 
(1.5) 


- 

0.639 
(5.3) . 




- 1 

0.052 

- (0.5) 


-0.260 - 

(1.7) 
0.547 
(4.2) . 




- 1 

0.041 

- (0.3) 


-0.526 1 
(3.0) 

(-) - 


0.105 
(0.7) 


-0.129 ■ 
(1.3) . 




r 0.108 

[ (0.6) 


-0.136 1 

(1.4) J 




0.0524 
(0.3) 


-0.209 1 
(1.1) J 



Estimation sample 1959:IV - 2007:11 
Absolute two-step consistent t- values in parenthesis. 

The mean real short rate is about 2.4% per year, whereas the mean inflation rate is just 
over 3%, the values in the top row of Table [T| With respect to the interaction between the 
short real rate and expected inflation, the lagged response (a) is in accordance with economic 
theory. Higher expected inflation leads to higher real rates, whereas higher real rates depress 
future inflation. The latter effect is far from signiflcant though. With respect to volatility 
(q and /?), both higher real rates and higher expected inflation lead to significantly higher 
variances. 

Table |2]shows the results for the models that are restricted to fulfill the Feller conditions. 
In the independent volatility model, initially obtained estimates for S2i,/52i and 021 were 
practically zero. Therefore, a zero value was subsequently imposed to increase accuracy 
of the other parameters. In this model, higher inflation now has a negative (though not 
signiflcant) impact (012) on future short term interest rates, which is contrary to economic 
theory, whereas in the previous case when the Feller conditions were not imposed, we found 
for this coefficient a positive value. In the other models, the impact of infiation on lagged 
real rates (021) is now positive, which is also in contrast with economic theory. 

5 Monte Carlo results 

Figure |3] shows the approximation errors made by the analytical expressions, in terms of 
yields, for each of the six cases as presented in Tables [T] and [2} The Monte Carlo simulations 
are based one million sample paths (containing 200 quarters) for the state variables. The 
yields are computed assuming the initial state variables are at their equilibrium values, which 
were a real short rate of about 2.4% and expected infiation of just over 3%. 

Ignoring the fact that volatilities are cut-off at zero does not seem to be important. 
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Table 2: Estimation results with Feller conditions imposed 



Proportional volatilities Dependent volatilities Independent volatilities 



-a-^b 



I + a 



a 



P 



r 2.34 

. (2.4) 


3.04 1 
(4.3) . 




r 2.36 
. (1-9) 


3.04 1 
(3.3) . 




r 2.91 

. (2.4) 


2.83 1 
(3.8) . 


r 0.924 

& 

I (0.9) 


0.083 - 
(1.5) 
0.925 
(25.0) . 




- 0.933 
. (1.1) 


0.061 - 

(1.2) 

0.936 

(28.7) . 




0.974 
(52.4) 

(-) 


-0.009 
(0.4) 
0.958 
(47.9) 


-0.412 

(2.3) 


-0.412 
(2.3) 




-0.098 

(5.7) 


-0.062 

(1.4) 




0.020 
(0.7) 


-0.108 

(4.5) 


r 0.108 

(2.2) 
0.108 

L (2.2) 


0.252 1 
(2.9) 
0.252 
(2.9) J 




r 0.028 
(1.5) 
0.028 

L (1.5) 


0.049 - 

0.049 
(1.6) J 




0.071 
(2.) 

(-) 


-0.044 
(1.2) 
0.100 

(5.7) 


1 

-0.292 
L (1.8) 


■ 

0.640 
(5.6) . 




. (1.9) 


-1.620 

(3.1) 
0.915 

(2.0) 




r 1 
^0^ 

L (-) 


0.615 1 
(3^4) 

(-) - 


0.0050 
(0.0) 


-0.124 
(1.1) 




■ -0.153 
. (1.3) 


0.667 
(1.6) 




-0.397 

(1.2) 


-0.125 
(0.1) 



Estimation sample 1959:IV - 2007:11 
Absolute two-step consistent t- values in parenthesis. 

Indeed, the 99% confidence band for the maximum approximation error in terms of yields 
stays within plus and minus one basis point (0.01%) for all models. It does not make much 
difference whether the Feller conditions are imposed (second column) or not (first column). 
Zero is almost always included in the confidence band, except for some maturities for the 
dependent and independent volatility models without Feller conditions imposed. For the 
proportional volatility model, there is never a problem, whether the Feller conditions are 
imposed or not. 

It might be the case that these good approximations are due to the fact the approxima- 
tion errors are calculated for the equilibrium yield curve. If the initial state variables imply 
a volatility closer to zero, ignoring the cut-off at f = might be more serious. Therefore, we 
also calculated the approximation errors for those state variables for which volatility was the 
lowest in the past. Figure |4] shows the worst result we found. Indeed, for maturities up to 
15 years, the simulated yields are significantly higher than the analytical ones. The reason 
is that for the initial state variables, the volatility is cut-off at zero. As the state variables 



evolve according to (34), whereas the formulas underlying the formulas assume (26), system- 



atic differences arise. Moreover, as the simulated yields are almost deterministic for short 
maturities, the confidence band is extremely small. In economic terms, the approximation 
error is still negligible though (at most one basis point). 

Finally, as the Feller conditions do not guarantee positive volatilities in the discrete time 
model, imposing them does not preclude statistically significant approximation errors from 
arising either. Indeed for both the dependent and independent volatility models with Feller 
conditions we found starting conditions for which significant negative approximation errors 
for maturities up to 17 years occur. However, as the maximum magnitude of these errors is 
at most 0.5 basis point, the economic relevance is again negligible. 
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6 Conclusions 

The Feller conditions are imposed on a continuous time multivariate square root process in 
order to have well defined strong solutions to the stochastic differential equations and to 
ensure that the roots have nonnegative arguments. For a discrete time approximate model, 
the Feller conditions loose part of their relevance. Existence of strong solutions is not an 
issue anymore and since the noise involves standard normal errors, there is always a positive 
probability that arguments of square roots become negative. Nevertheless, keeping in mind 
the idea that a discrete time model is an approximation of a continuous time model, it is 
natural to still impose the Feller conditions. On the other hand, it has also been observed 
that even without the Feller conditions imposed, for a practically relevant model, negative 
arguments rarely occur. 

We have investigated the relevance of imposing the Feller conditions for a two-factor affine 
term structure model, where the factors (ex-ante real short term interest rate and expected 



Figure 3: Mean and 99% confidence interval of the difference between simulated and analytical yields 
if starting state variables are in equilibrium 
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inflation) are modelled as a square root process. As we want to allow volatilities to depend 
on both state variables, the models are not estimated in canonical form. Therefore we also 
explicitly presented the Feller conditions for square root models not in canonical form. 

Three different models have been used, that have been referred to as models with propor- 
tional, dependent and independent volatilities, either with or without the Feller conditions on 
the parameters. The parameters of each of the underlying models have been estimated using 
quarterly German data. The restrictions involved in imposing the Feller conditions resulted 
in unappealing economic results. In the proportional and dependent volatility models, the 
restrictions imply a positive impact of interest rates on inflation, whereas in the independent 
volatility model, inflation now leads to lower interest rates. Both elements are contrary to 
the accepted economic theory. 

For these six cases we have compared the resulting yields, that are either obtained by 
(approximate) analytic exponentially affine expressions or those obtained through Monte 
Carlo simulations of very high numbers of sample paths. It turned out that the approximation 
errors in analytical yields were rarely statistically significant, and never economically relevant, 
as they were always below one basis point. In particular a proportional volatility model 
without the Feller conditions imposed already gave very good results. 
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